CDS
Accession Number | TCMCG064C35141 |
gbkey | CDS |
Protein Id | XP_011102331.1 |
Location | complement(join(5491..5691,7754..7903,8380..8579,9414..9540,11059..11169,11543..11614,11699..11788,11863..11973,12105..12659,12848..13795)) |
Gene | LOC105180373 |
GeneID | 105180373 |
Organism | Sesamum indicum |
Protein
Length | 854aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA268358 |
db_source | XM_011104029.2 |
Definition | DNA mismatch repair protein MSH2 isoform X2 [Sesamum indicum] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGTTTGAGACAGTGGCCCGAGACGTTCTTTTGGAGAGGGCTGATCACACTCTAGAATTGTATGAGGGCACTGGAGCAAATTGGAGACTTGTGAAGAGTGCAACCCCTGGAAATTTAGGCAGTTTTGAAGAAATTCTGTTTGCTAATAACGAAATGCAAGACTCTCCAGTGATTGTGGCTCTTATTGCAAATTTCCGTGAAAATGGATGTGCTGTTGGCTTGAGTTATGTTGACCTTACTAAGAGGGTGCTTGGATTGGCAGAATTTCCTGATGATAGCCACTTCACAAATTTGGAGTCAGCTCTTGTTGCACTTGGTTGCAAAGAAATCCTTTTGCCTGTAGAGGTTGCCAAATCTAGTGAATATAGATCACTAAATGATGCATTGTCCAGGTGTGGTGCAATGGTAACTGAAAGAAAGAAAGCTGAGTTTAAAGGAAGGGATTTGGTACAAGATCTTGGTAGGCTCGTGAAAGGGTCTATGGATCCAGTAAGAGATCTCCTTGCTGCATTTGAACTTGCACCTGCGGCTTTGGGGTGTATAATGAGTTATGCAGACCTACTTGCAGATGAGAGCAATTATGGTAACTACAAAATCCAGAGATACGATCTTGCTAGGTACATGAGACTGGATTCTGCTGCCATGAGGGCACTGAATGTCATGGAGAGCAAAGCTGATGCTAACAAAAACTTCAGCTTATTTGGTCTTTTGAATAGAACCTGTACTGCAGGAATGGGCAAGCGTTTACTGCACATGTGGCTGAAACAACCTTTATTGGATGTAAATGAAATAAACTGTAGACTGGATTTGGTACAAGCTTTTGTGGAGGATGGGGCACTACGCCAAGATCTAAGGCAGCAATTGAAAAGGATTTCAGATATGGAGCGACTGACGCGGTCACTCGAGAAGAAAAGAGCAAGTCTTGTGCATGTTGTTAAGCTTTATCAGTCAAGCATCAGACTTTCCTTTATCAAAAGCGCACTGGAGCAGTACAATGGCCAATTTGCTTCATTGATCAAGGAAAGATATTTGGATCCTTTAGAAAACTGGACTGATGATAACCATCTGAATAAGTTCATTGGTCTTGTGGAAGCTTCTGTAGACCTTGAACAACTTGAAAATGGAGAGTACATGATTTCATCAGGATATGATTCACAGTTATTGGCTCTCAAAAATGAACAAGAGTCTCTGGAACATCAGATTCATGATTTGCACAGAAAAGCAGCTAATGATCTTGATCTGGCTCTTGATAAAGCTCTCAAATTAGAAAAAGGGACACAACATGGATATGCCTTTAGGATTACGAAAAAGGAGGAGCCAAAAGTACGGAAGAAGCTGAATACCCAATTTATTCTTATTGAAACTCGCAAGGATGGGGTAAAATTCACAAACATAAAGCTTAAGAAACTAAGTGAGCACTACCAGAAGGTAGTTGAAGAATATAAGAACTGCCAGAAAGAATTGGTTGCTAGAGTGGTCCAAACTGCTGCAACTTTCTCCGAGGTGTTTGAAGGAGTAGCTTGGTCGCTCTCAGAATTGGATGTTTTACTTAGTTTTGCTGATGTGGCTGCTAGCTCTCCAACTCCTTACACACGGCCACTCATCACTCCATTGGATGAGGGAGATATTATTTTGGAAGGGAGTCGGCATCCTTGCGTTGAAGCTCAAGATTGGGTGAACTTTATCCCGAATGATTGTAAACTGGTTAGGGGGAAAAGTTGGTTCCAGATTATTACAGGACCAAACATGGGGGGAAAATCAACGTTCATACGACAGGTTGGTGTGAACATTCTGATGGCACAAGTTGGTTCCTTCATACCGTGCGATAATGCTAGTATTTCTGTTCGTGATTGCATTTTTGCTCGTGTGGGTGCTGGTGACTGCCAGCTACGAGGAGTTTCTACTTTCATGCAAGAGATGCTTGAGACTGCATCAATCTTGAAAGGAGCAACTAAGAGGTCGCTAATAATAATAGATGAGCTAGGTCGTGGCACGTCAACGTATGATGGATTTGGCTTAGCATGGGCCATTTGTGAGCACATAGTTGAAGTGATAGAAGCACCTACACTGTTCGCTACTCACTTTCATGAGCTGACTGCATTAGCTCATGAAAATGCTCATGAGCAATCTTCAAAGAAATTTATAGGTGTAGCGAATTATCATGTGAGTGCACATGTTGACTCGTCAACTCGCAAGCTTACCATGCTTTACAAGGTTGAACCCGGAGCCTGCGATCAAAGTTTTGGTATACATGTTGCTGAATTTGCTAACTTTCCAGAAAATGTTGTTGCCCTTGCCCGAGCAAAGGCTTCTGAGTTGGAAGATTTTTCACCCATCACAATTGTGGCCCCTGATGCCAAAGAGATGGGTTCTAAAAGGAAGCGAAATTGGGACCCTGATGATGTACATAGGGGCACTGAACGAGCTCGCCAGTTCTTGAAGGACTTCTCTGAGTTGCCACTGGACAAGATGGATCTGAAGCAAGCGCTACAACACATCAGCAAATTGAAAGCTGACTTGGAGAAGGATGCAGTTAGCTGTTCCTGGCTCCAGCAATTCCTCTAG |
Protein: MFETVARDVLLERADHTLELYEGTGANWRLVKSATPGNLGSFEEILFANNEMQDSPVIVALIANFRENGCAVGLSYVDLTKRVLGLAEFPDDSHFTNLESALVALGCKEILLPVEVAKSSEYRSLNDALSRCGAMVTERKKAEFKGRDLVQDLGRLVKGSMDPVRDLLAAFELAPAALGCIMSYADLLADESNYGNYKIQRYDLARYMRLDSAAMRALNVMESKADANKNFSLFGLLNRTCTAGMGKRLLHMWLKQPLLDVNEINCRLDLVQAFVEDGALRQDLRQQLKRISDMERLTRSLEKKRASLVHVVKLYQSSIRLSFIKSALEQYNGQFASLIKERYLDPLENWTDDNHLNKFIGLVEASVDLEQLENGEYMISSGYDSQLLALKNEQESLEHQIHDLHRKAANDLDLALDKALKLEKGTQHGYAFRITKKEEPKVRKKLNTQFILIETRKDGVKFTNIKLKKLSEHYQKVVEEYKNCQKELVARVVQTAATFSEVFEGVAWSLSELDVLLSFADVAASSPTPYTRPLITPLDEGDIILEGSRHPCVEAQDWVNFIPNDCKLVRGKSWFQIITGPNMGGKSTFIRQVGVNILMAQVGSFIPCDNASISVRDCIFARVGAGDCQLRGVSTFMQEMLETASILKGATKRSLIIIDELGRGTSTYDGFGLAWAICEHIVEVIEAPTLFATHFHELTALAHENAHEQSSKKFIGVANYHVSAHVDSSTRKLTMLYKVEPGACDQSFGIHVAEFANFPENVVALARAKASELEDFSPITIVAPDAKEMGSKRKRNWDPDDVHRGTERARQFLKDFSELPLDKMDLKQALQHISKLKADLEKDAVSCSWLQQFL |